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, Abstract. We prove that random d-regular Cayley graphs of the symmetric group asymp- 

i totically almost surely have girth at least (log^_i 101)^/^/2 and that random d-regular Cay- 

ley graphs of simple algebraic groups over asymptotically almost surely have girth at 
least log^_i \G\/ dim(G). For the symmetric p-groups the girth is between loglog|G| and 
(log IGj)" with a < 1. Several conjectures and open questions are presented. 

(N ■ 



1. Introduction 

The girth of a graph is the length of a shortest cycle. Finite regular graphs of large 
girth are a natural analogue to the infinite tree. While random regular graphs have nice 
expansion properties, their girth tends to be small, as small cycles can appear at many 
^ . places mdependeutly. The objects of study of this paper, random Cayley graphs, overcome 
' this problem. While being random, they are vertex-transitive, giving short cycles fewer 
00 . opportunities to appear. 

^ \ Graphs of large girth. Let g = g{n, d) be the largest possible girth of a rf-regular graph of 

! size at most n. Deriving good bounds on g{n, d) for any d > 3 is a notoriously hard problem. 

O ■ 

If we consider d > 3 fixed and growing n, the best asymptotic estimates known are: 

>< : (1) (2 + 0(1)) ■ log,„i n > g{n, d)>{^- o(l)) ■ log,_i n. 

While it may appear that the problem is essentially solved, the constant factor gap is crucial 
here. Clearly, when considering the inverse of g the constant factor gap becomes an exponent 
gap. Also, it is a small miracle that the lower bound constant 4/3 is greater than 1, see 
Conjecture |5l 

The first inequality in ([T]) is a version of the Moore bound. It is a consequence of a simple 
counting argument stating that a ball of radius [{g — 1)/2J around a vertex (or an edge) is 
a tree, and therefore must have n{(d — 1)^/^) distinct vertices. 

For a family of rf-regular graphs Qi of logarithmic girth, let 'y{{Gi}) = liminfj^oo \og^^^^^\g.\) ■ 
Erdos and Sachs [16j described a simple procedure yielding families of graphs with large 
girth with 7 = 1. The first explicit construction of an infinite degree 4 family with 7 ^ 0.83 
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was given by Margulis [31j, who also gave examples of infinite families with arbitrary large 
degree and 7 ~ 0.44; the constructions in question are Cayley graphs of SL2(Fp). Imrich [20j . 
extending the work of Margulis, constructed a family of Cayley graphs of arbitrary degree 
with 7 ^ 0.48, and cubic graphs with 7 ^ 0.96. A family of geometrically defined cubic 
graphs introduced by Biggs and Hoare was proven to have 7 > 4/3 by Weiss |39j . 

Examples of graphs of arbitrarily large degree satisfying 7 > 4/3 where given by Lubotzky, 
Phillips and Sarnak [2^ and by by Margulis [32]: these are celebrated Ramanujan graphs 
X^'"^ - Cayley graphs of PGL2{q) with respect to a very special choice of {p + 1) generators, 
where p and q are primes congruent to —1 mod p with the Legendre symbol = — 1. A 
similar result was obtained by Morgenstern [M] for any prime power in place of p. Biggs and 
Bosher [in] proved that the constant 7 = | for X^''^ is essentially the best possible, namely 
they showed that 

girth(XP''?) < 4 logp q + log^ 4 + 2. 

For every prime power q Lazebnik, Ustimenko, and Woldar [27] constructed families of q- 
regular graphs with 7 > | logg(g — 1). 

Random Cayley graphs. Let G be a finite group and let S G G. The (undirected) Cayley 
graph Q{G, S) is the undirected graph with the vertex set G and the edge set {{g,gs) : g G 
G,s E SU S^^}. Given some group G, a random 2fc- regular Cayley graph of G is the Cayley 
graph Q{G,S U S^^) where is a set of k elements from G, selected independently and 
uniformly at random. 

The properties of this model for random graphs received considerable attention in the last 
decade. The expansion of such graphs (for 15*1 growing with IGI) was considered by Alon- 
Roichman [1], Pak [35] and Landau- Russell [25]. The diameter of random Cayley graphs on 
the symmetric group was considered by Babai et al. [8], [6l [7]. 

In this work we consider the girth of random Cayley graphs on various groups. It turns 
out that random Cayley graphs of the symmetric group and of the algebraic groups over 
finite fields, tend to have high girth. This is in contrast to random d-regular graphs that 
tend to have constant girth [211 133] . 

Fixed walks in random graphs. In the classical models for random walks in random 
environment, an environment is created by some random process, then a particle performs 
a random walk on this environment. 
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Normally, there are two ways to look at such walks; quenched properties of the random 
walk are for the typical environment, and annealed properties of the random walk are 
averaged over environments. 

The model of random Cayley graphs allows for a third interpretation, as the random walk 
(a sequence of symbols w = W1W2 ■ ■ - Wn from S) can be fixed in advance of generating the 
random graphs. Thus, for a random Cayley graph model we can always talk about the fixed 
walk on the random graph, which, equivalently, is a random evaluation of the word w. 

All of our girth results are based on bounds on the return probabiHty of fixed walks on 
random graphs. More precisely, given a sequence of groups, we will show that 



See Section [2] for further details and discussion. 

Results in this paper. We study the girth of random Cayley graphs for three natural 
classes of groups. The methods used to prove ([2]) are unique to each class. While the first- 
order asymptotics of the girth is still an open problem in all cases, our results give bounds 
of varying precision. 

The most general such result is a simple corollary of the following theorem, due to Dixon, 
Pyber, Seress and Shalev [H]: 

Theorem 1. [13] Let Gn he a sequence of simple groups with increasing order, and let w he 
a word. Then as n ^ 00, P{w = 1 in Gn) 0. 

Corollary 2. For k random generators, girth(G'„) ^ 00 in prohahility. 

In the case of the symmetric group, we show the following in section [31 
Theorem 3. As n 00, a.a.s. the girth of the d-regular random Cayley graph of Sn is at 



However, we conjecture that the girth is equal to O (log 1 5*^1). 

In section H] we consider families of simple groups of Lie type; in this case representations 
as matrices is very helpful and we can get stronger bounds. 



(2) 




least (1/2 - 0(1)) ■ y/\og,_,\S, 



Theorem 4. As q ^ 00 a.a.s. the girth of the d-regular random Cayley graph of G{¥g), 
where G is a simple group of fixed Lie type and fixed rank overWg is at least (7—0(1)) log£;_i [(^(Fq)! 
with 7 = l/dim(G'). 
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The bound in Theorem H] is optimal except for the crucial constant 7. It should be noted 
that the construction yielding the lower bound in ([T]) is a Cayley graph of PGL2(]5). However, 
it seems from computer experiments (section l4.2p . that such a result (or even a lower bound 
of 1 ■ log^_2^ IGI) is unlikely for a random Cayley graph of PGL2(p). In fact, achieving better 
girth than log^_i \G\ seems to be a barrier for many combinatorial constructions such as the 
result of Erdos and Sachs |16j . 

The 7 = 1 threshold can be obtained as a consequence of the following appealing heuristics. 
Let w{xi, . . . , Xk) be a fixed word in a free group on k generators xi, . . . , x^. Let fwidi, ■ ■ ■ ,gk) 
be an element in G obtained by substituting Xi = Qi- Define 

(3) PG{w)=FToh[Ug^,...,gk) = l]. 

Suppose that for a fixed short words w we have Pg{w) ~ 1/1^1, and that the events [/^^ = 1] 
and [fw2 = 1] are independent for "generic" Wi and W2- Then by counting words we could 
easily get that girth(G')/ log^„i \G\ — 1 as |G| — 00 a.a.s. 

Both of these assumptions are false. It seems that it is not possible to decrease the 
satisfaction probability for a sufficiently large number of words, but it is possible to increase 
it, for example in Abelian groups. 

The independence assumption is not needed for the lower bound (where the union bound 
can be used), but problematic positive correlations arise when one tries to prove upper 
bounds. Yet we believe that for random Cayley graphs, a stronger version of the Moore 
bound holds (with constant 1 instead of 2). 

Conjecture 5. Let be a sequence of groups. As n 00, a.a.s. the girth of the 

d-regular random Cayley graph of Gn is at most (1 + o(l)) log^_i \Gn\. 

The third family of groups we are considering are p-groups, which may be thought of as an 
intermediate class between Abelian groups (where the girth is at most 4) and simple groups 
(where the girth can be logarithmic). This is the only case where we have an upper bound 
better than Moore's. 

The symmetric p-group Wn{p) of height n is the n-fold iterated wreath product of Z/(p Z). 
It is isomorphic to the Sylow p-subgroup of the symmetric group Sym(p"). It plays the role 
analogous to the symmetric group in the realm of p-groups: it is a basic family of groups 
containing all finite p-groups as subgroups. 
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Theorem Q. As n ^ oo, a.a.s. the girth of the d-regular random Cayley graph of the 
symmetric p-group G = Wn{p) satisfies 

(1 - o(l))/51oglog |G| < gn < (1 + o(l))(log 

where a < 1 is a constant depending on p only, and [3 depends on p and d. 

In Section [5l we present the proof of Theorem [6l as weU as heuristics to show why the 
upper bound should be closer to the truth. It turns out that it helps to relate this problem 
to a simple toy model for genetics. 

A basic question in this direction is 

Question 7. Does there exist a sequence ofp-groups of increasing order with random Cayley 
graphs of logarithmic girth ? 

We conclude the paper in section E] by commenting on the analog of large girth property 
in the case of compact Lie groups. 

2. The union bound and worst case analysis 

In all of our proofs we estimate the probability that a given word evaluates to the identity, 
thus creating a short cycle. All of our girth results are based on bounding the probability 
of random elements to satisfy a given word. More precisely, given a sequence of groups G„, 
and a word w m d generators for a fixed c? > 2, we show that 

sup PaSw = l) = o{{d-l)-'-), 

for some sequence £„. Summing over all words we get that 

P(girthG'„ < la) = P{w = 1 in for some \w\ < £„) 

(4) < ^ P(w = 1 in 

(5) < #{w : \w\ < in} sup P{w = 1 in G„) 

\w\<e„ 



i+dj2id-iy]o{{d-i] 



1=0 

= o{l) 

We believe that the sup bound is wasteful; see Remark [T] in the next section. 

The other potentially wasteful part is the union (jl]) bound, which is not far off when 
events are not positively correlated. However, it seems that at least in some cases, there are 
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correlations. For example, consider words w and w' in two generators, a, h. In a significantly 
large portion of such words, the exponent sum of a equals 0. Thus w = w' = 1 iih = 1, giving 
P{w = w' = 1) > l/IG]. Typically, we expect P{w = 1) x and so the uncorrelated 

case would be P{w = = 1) x We don't know how to take advantage of these 

correlations for lower bounds. Moreover, they have blocked our attempts for upper bounds 
on girth via the second moment method. 

2.1. Limits of the union bound. The expression in the union bound (jl]) can be written 
double sum 

\w\<en \w\<e„gi,--;gk 
where 1 is 1 if its argument is true and zero otherwise. Switching the order of summation 
and changing back to probabilities gives 

9iv.9fc 

where w now is a uniform random reduced word of length i. If w was just a uniformly chosen 
word , then P{w = 1) would mean the chance that a random walk on G with generators {gk} 
is at the origin at time n. It is easy to check (and well-known, see [5j p. 139) that for even i this 
probability is at least 1/|G|. Using this it is possible to show that if ^„ > (1 + e) log^^^ \Gn\, 
then (jlj) cannot be o(l), and no proof using the union bound could work to show that the 
girth is at least (1 + e) log^^i \G\. 

2.2. Random evaluation of words. Bounds on P{w = 1) have appeared in the literature 
[13]. A nice bound, using transitivity properties of groups appears in pQ. In [22] it is shown 
that only finitely many finite simple groups satisfy a given non-trivial law w. Word maps 
are studied in [2Sl 1^ - 

It is also natural to ask (in context of the last paragraph) for which groups do we have 
P{w = 1) > l/IG*!. Perhaps surprisingly. It turns out that this is always true for all words in 
one or two generators, but not necessarily for three. See [2] for many counterexamples and 
discussion. 

3. Random Cayley graphs of Sn 

For some k > 2, let Ui, . . . ,0"^ be independent uniform random permutations from Sn- 
Let d = 2k, and S = {cr^^ , . . . , a^^} . We prove for G = C(S'„, 5) that a.a.s girth(G') > 
c • ^yn\ogn/ \og{d — 1) for any constant c < 1/2. 
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Remark 1. The crucial bound used in our proof of Theorem [3] is the upper bound on Pg{w), 
which holds for all non-trivial words of length at most /. We observe that, for the power 
word w = a\ this bound is almost tight. Indeed, Pciw) is at least the probability that the 
first \n/l\ cycles of a random permutation in Sn have length /. Therefore 

\n/l\-l 

Pg{w)> n l/in~tl)>il/n)T. 

2 = 

Therefore in order to improve upon Theorem [3l by more than a constant factor, one needs 
either to avoid the union bound on w or refine the upper bound on Pg{w) to incorporate 
more information on the structure of w. 

Proof of Theorem Our first observation is that the girth of G is the length of the shortest 
non-trivial relation between cxi, . . . , a^. Therefore, girth(G) > g with high probability, if for 
most choices of Ui, . . . , 0"^, no non-trivial word in (Xi, . . . , 0"^ of length smaller than g is the 
identity permutation. Clearly, it suffices to check only non-trivial cyclically reduced words 
of length i < g. Namely words w = sq - ■ ■ si-i, satisfying Si ^ Sj^\ ^^^^^ for 0, . . . ,i — 1. 
We denote the set of such words by Irred^; clearly |Irredg| < {d — 1)^. The probability 
of girth(G') < g is bounded by J^welrred Pg{w), where Pg{w), defined in ([3]) denotes the 
probability that w is the identity permutation. That is Pciw) is the probability that w fixes 
all the n points 1, . . . , n. 

Given a word w = sq - ■ ■ se-i and some starting point xi, we trace the path xi, xiSq, xiSqSi, . . ., 
exposing the necessary entries of the permutations ai, . . . , 0"^ one by one. In order that w will 
fix xi, some coincidence must occur. That is, when exposing the entries of the path starting 
at Xi, there has to be a first time when the path arrives at Xi by some permutation different 
from Sq^. The probability of such an event occurring at any specific step i is bounded by 
— e), where e is the number of entries exposed so far. Since e is at most i, and since 
there are at most £ choices for i, we have Pr[xit(7 = xi] < i/{n — i). 

Suppose that we already verified that w fixes Xi, . . . ,Xm-i by exposing the necessary 
entries. Then we have exposed at most (m — 1) ■ i entries. As long as this number is 
smaller than n, we can choose a point Xm such that no entry involving Xm was exposed yet. 
Repeating the previous argument, yields an upper bound of i/{n — mi) on the probability 
that w fixes Xm, even when conditioning on the previously exposed entries. Therefore the 
probability that w is the identity permutation is bounded by {i/{n — m£))™, as long as 
mi < n. Substituting m = n/{2i), yields the bound 

(6) P{w) < (2£/n)'^/(2^). 
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Therefore 

Pr[girth(G') < g] < llrredj ■ (2g/nY'^^3) < [d - ly ■ {2g/n)''/^^>^\ 
Setting g = c - A/?^log^Vlog(rf^n!) for any constant c < 1/2, yields the required result: 
Pr[girth(G) < g]) < exp(-fi( v/nlognlog(rf - 1) )). 

□ 

4. Random Cayley graphs of simple groups of Lie type 

Before proving Theorem H] in general (in section HTil) . we give an elementary proof for the 
group SL2(Fp) (in sect ion |4?T|) and discuss computer experiments (section I^y2|) and connection 
between girth and expansion (section 14.31) in the case of this group. 

4.1. Random Cayley graphs of SL2(Fp). We begin by giving an elementary proof of the 
lower bound on the girth of a random 2/c-regular Cayley graph of the group PGL2(Fp) for 
prime p; the proof for SL2(Fp) is similar. The Cayley graph is constructed with respect to 
the set S = {gf^, ■ ■ ■ ,gk^}, where d = 2k and gi, . . . ,gk are independent uniform random 
elements from PGL2(Fp). 

Theorem 8. As p ^ oo, a. a. s. the girth of the d-regular random Cayley graph of G = 
PGL2(Fp) orofG = SL2(Fp) is at least (1/3 - o(l)) ■ log^^i \G\. 

Before proceeding with the proof of Theorem [H] we recall the upper bound on the number 
of projective zeros of a polynomial. 

Theorem 9 (Serre [36], S0rensen [38] )• Homogeneous polynomial in m variables in Fp of 
degree d has at most dp'^~'^ + (p'"^^ — l)/(p — 1) projective zeros, and this is sharp. 

To prove Theorem [HI we start with the following lemma: 

Lemma 10. Let w be a word of length i in the free group JF^. If w is not identically 1 for 
every substitution of values from PGL2{p), then for a random substitution 

Ft[w = 1] < i/p + 0{p-'^) 

where implied constant depends on k only. 

Proof. The word w{gi, . . . , gk) evaluated in GL2 is a matrix whose entries are rational func- 
tions of the entries of the gi. The reason they are not polynomials is that w may contain 
inverses of the form g^^, so that a factor of 1/ det{gi) appears. Nevertheless, the equation 
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w = Jx constant (i.e. w = 1 in PGL) reduces to three homogeneous polynomial equations 
of degree i in 4k variables, corresponding to the equations an = 022, 012 = and 021 = 0. 

By our assumptions at least one of these equations is not identically zero. So by Theorem[9] 
it has at most £p^'^~^+0(p^'^~^) solutions among all possible matrices gi, . . . , g^, and therefore 
there are at most this many in the subset (GL2(p))'^. Since multiplication by constant 
matrices preserves solutions, it follows that there are at most £p'^^~^{p — + 0(p^*'~^) 
solutions to w = 1 in PGL2(p)^. Dividing by the fc-th power of |PGL2(p)| = p{p — l){p + 1), 
completes the proof. □ 

Proof of Theorem\^ Let d = 2k. The number of words of length i or less is at most (rf— 1)^^^. 
The probability of each such word is at most i/p + 0(p~^). So by the union bound all we 
need is that {d — lY~^^i/p = o(l), which holds if 

i = log,_iP - 21og,_i \og,_,p= (1/3 - 0(1)) ■ log2,_i \PGL2{p)\. 

We made the assumption that words of length i or less do not yield the identity for all 
substitutions; this follows from the following proposition: □ 

Proposition 11. For any k the length of the shortest non-trivial word w{xi, . . . ,Xk) such 
that fw{gi, . . . ,gk) = I for all gi,...,gk m SL(2, Fp) is at least Q{p/ logp). 

Proposition [TT] follows from Lemma [T^ and Corollary proved below. 

Lemma 12. The length of the shortest non-trivial word w{xi,X2) such that fw{gi,g2) = 1 
for all gi,g2 in SL(2,Fp) is at least p. 

(l 0^ 

Proof. Suppose we have a word in two generators g, h and let (7 = | , /i 

\x 1 

By a simple inductive argument, for all integers /i, fci, ...,/„, /c„ we have 




/ll /12 

^/21 f 22 + hkl ... InknX'^"' 

where /n, /12, /21, /22 are polynomials of degree at most 2n — 1. If the length of the word 
is less than p then all Zj and ki are less than p in absolute value, and hence liki . . . Inkn is 
not congruent to zero modulo p. Consequently we have that (/22 + liki . . . Inknx"^^) — 1 is a 
nontrivial polynomial of degree 2n, which has at most 2n roots. Since 2n is clearly also less 
than p, there is choice of x for which the polynomial is not zero modulo p; hence for such x 

we have that g^^h^^ g^'^h^^ . . . g^'^h^'^ ^ ( g 1 ) ^' 
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Similarly, we obtain that 

21 + hh . . . f22^ 

where f n, f 12, "^21, ^22 are polynomials of degree at most 2n; and apply the preceding argu- 
ment. □ 

Lemma 13. Let 00 be a non-empty length I reduced word in the k letters S = {gf^^gt^, • • • , gk^^}- 
Then for any k > k' > 2, one can find words tui, . . . ,ujk in the letters S' = {gf^, gf^, . . . , g^,^} 
so that the word uj' obtained from uj by substituting gf^ by ojf^ for i = 1, . . . ,k, does not 
reduce to the empty word. Moreover, one can find such words with \uji\ < 3 + 21og2fc/_i /log/. 

Corollary 14. Let I be the length of the shortest non-trivial word in two letters over the 
group G. Then the length of the shortest non-trivial word over G in any number of letters 
is at least Q{1 / log I) . 

Proof of LemmalWi Given the word u as above, we set uji = uJi^iXiUJi^R, where cjj.L and uJi^R 
are uniform independent random reduced words of length s, and Xi is chosen from S' so that 
no cancellations occur in uji. We claim that for a sufficiently large length s, the resulting 
word uj' does not reduce to the empty word with probability greater than zero. 

Suppose that uj' reduces to the empty word. Then, one can obtain the empty word 
from uj' by repeatedly deleting consecutive pairs of a letter and its inverse. Since, no letter 
Xi can cancel until one of the half-words uji^l or uji^r cancels, two half words that appear 
consecutively in the expanded word must cancel. Since two independent length s reduced 
words cancel with probability 2k' {2k' — and since there are only / — 1 consecutive 

pairs of half words in the expanded word uj', one obtains by union bound that uj' reduces to 
the empty word with probability at most l{2k'){2k' — 1)^'^'^^^^ which is less than one for the 
claimed value of s. □ 

4.2. Computer experiments for PGL2(p). In contrast to the permutation group Sn and 
the iterated wreath product Wn-, the size of PGL2(p) grows moderately with p. This allows 
getting some intuition on the asymptotic girth of PGL2(p) from computer experiments. 
We conducted experiments on random 4-regular Cayley graphs over G = PGL2(p). The 
experiments where conducted for varying primes p, where each experiment was repeated 
1000 times. In each case, our computer program either returned the length of the shortest 
cycle, or announced it to be greater than 30. 

In light of the experimental data, we make the following two conjectures: 
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Conjecture 15. The girth of such graphs is almost surely even. 

Conjecture 16. The girth of such graphs is (c+o(l)) log3 IGI, for some constant c, satisfying 
1/3 < c < 1. Furthermore, a.a.s. the girth is one of two consecutive even numbers. 

We give the following excerpt of our experimental data. As mentioned, for each value of 
p we computed the girth of 1000 random 4- regular Cayley graphs over PGL2(p). In the first 
table n^^^^ is the number of graphs with odd girth (out of thousand). 



p 


101 


331 


1009 


2003 


4001 


10007 20011 


40009 


100003 


^odd 


146 


138 


66 


42 


22 


16 8 


7 


1 



The second table lists, the number of times each even girth was attained (out of thousand). 
The two most abundant values of the girth where marked in bold. Normalizing these two 
values by dividing the girth by logg |PGL2(p)| yields: 0.85,0.95 for p = 1009; 0.87,0.95 for 
p = 10007; and 0.83, 0.89 for p = 100003. 



girth 


< 10 


12 


14 


16 


18 


20 


22 


24 


26 


28 


30 


> 30 


p = 1009 


52 


71 


111 


224 


295 


172 


9 

















p = 10007 


9 


7 


18 


38 


93 


198 


296 


296 


29 











p = 100003 








1 


5 


8 


39 


60 


148 


317 


342 


79 






4.3. Girth and expansion. In [13] it is shown that if Sp is a symmetric generating set for 
PSL2(Fp) {p prime) such that girth(^(PSL2(Fp), Sp)) > clogp, where c is independent of p, 
then ^(PSL2(Fp), Sp) form a family of expanders. Combined with Theorem H] this implies 
that Cayley graphs of PSL2(Fp) are expanders with respect to generators chosen at random 
in PSL2(Fp). The following conjecture, combined with the result in p3], would imply that 
Cayley graphs of SL2(Fp) are expanders with respect to any choice of generators. 

Conjecture 17. Suppose (Sp) = PSL2(Fp). There is a constant C, independent of p, 
satisfying the following property: the hall of radius C in the generating set Sp contains two 
elements g, h such that girth(^(PSL2(Fp), {g, h})) > \ogp. 

4.4. Proof of Theorem 14]. Note first, that, by [28], almost all d-tuples of elements in G{q) 
generate G as g ^ oo. 

Let d = 2k. Let be the free group on xi, . . . ,Xk. For w & F^ and a group G set 

Vu,iG) = {{gi, . . . ,gk) : gi e G, w{gi, ...,gk) = 1}. 
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Suppose G{q) is a Chevalley group, coming from the simple algebraic group G. The set 
Vw is an algebraic set in G''; by Borel's theorem [12] for a nontrivial word w, Vw is a. proper 
subvariety of G''. 

Set e = dimG. Then we have 

dimV^{G) <ke-l. 

Note however that V^(G') may well be a reducible subvariety. 

Now, suppose w has length at most /. We can view elements of G as matrices (in a 
natural way if G is classical, or using a minimal faithful module if G is exceptional). Then 
the requirement w{gi, . . . , gk) = I translates into polynomial equations of degree at most / 
in the matrix entries. Denoting by r the rank of G the number of such equations is bounded 
above by ar^ for some absolute constant a. To define Vw{G) over the affine space of matrices 
we need to add say /(r, k) fixed equations defining G^ there. 

It is known that an affine variety V of dimension D defined by m equations of degrees < / 
has at most l"^{q + 1)^ g-rational points. This follows from Bezout theorem and intersection 
theory. Moreover, the same applies if, instead of taking fixed points of Frobenius, we count 
solutions to x'^ = h{x), which define the finite twisted groups of Lie type. See Section 10 of 
Hrushovski [19j for these facts. 

Combining this with the information in the previous paragraph regarding we obtain 

\VUGiq))\<hr\q+ir-\ 

where bi = 6i(r, k) depends on r and k. Since ~ q^ we have 

(7) \VUG{q))\/\G{q)'\<br'/q, 

where b = b{r, k) is a constant. 

Now noting that the expression on the left-hand side of ([7]) is the probability of the word 
w being equal to identity and applying the union bound of section [2] completes the proof of 
Theorem |H 

5. Girth for j9-groups and toy genetics 

The symmetric p-group Wn{p) of height n is the n-fold iterated wreath product of Z/(p Z). 
It is isomorphic to the Sylow p-subgroup of the symmetric group Sym(p"'). Also, it is 
isomorphic to the automorphism group of the height n rooted p-ary tree. The group Wn 
plays the analogous role to the symmetric group in the realm of p-groups: It is a basic 
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family of groups containing all finite p-groups as subgroups. The size of this group satisfies 

In this section, we study the girth of the symmetric p-group; we restrict our attention 
to p = 2, as it is conceptually and notationally more clear. Analogous results hold for 
other primes p. The symmetric 2-group is also the graph automorphism group of the rooted 
binary tree of height n. Each element g of Wn = Wn{2) can be written as {gi, (72) Xg j where 
gi G Wn-i are elements of the automorphism groups of the two subtrees Ti,T2 of T with 
roots at level 1, and Xg e Z/(2Z) either switches Ti and T2 (active) or equals the identity 
(inactive). 

We start with a word w in some letters a,b,... and their inverses a^^ = a,b^^ = b. . .. 
Assume that the values of , xj, , . . . are known. Then wi and W2 can be expressed in terms 
ai,a2,bi,b2 . . .. 

It is also clear that if g is a uniform random element in Wn, then gi,g2 G M^n-i and Xg 
are independent uniform choices. 

The following toy genetics model describes the way words Wi and W2 (and their recursive 
offsprings) are determined. 

A toy genetics model. Here we describe a biologically incorrect model for the genome 
evolution of a strictly asexual organism, henceforth referred to as an "amoeba" . 

The DNA of an amoeba is a sequence of length I of "forward" bases, and their inverses, or 
"backward" pairs. Backward and forward versions of the same base cannot be next to each 
other in the DNA. 

At each integer time, each amoeba undergoes fission into two offspring, and its DNA is 
inherited as follows. First, two fresh copies of the DNA are created. Then, "crossing over" 
symbols are introduced as follows. Each pair of forward and backward bases introduces 
its own crossover symbol into the sequence: the forward alleles after their occurrence; the 
backwards ones, before. 

Each crossover symbol is active or inactive, with equal probability, independently of others. 
Crossovers happen at the active symbols. 

For example, starting with the word w — ahcaac (where a — denotes the backward 
pair of a) the two fresh copies are aifciCiaiaiCi and a262C2a2Q2C2- With the introduction of 
the crossover symbols, the word becomes 

ahcaac =^ Xadbx^c 
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Say the random settings activate the symbols and Xb , but not Xc . Then the DNA of the 
two offspring are: 

Wi = d2b2Ciaia2Ci, 
(8) W2 = aibiC2a2aiC2. 

We are interested in how fast the DNA diversifies. Call an amoeba free if its DNA consists 
of all different bases. Starting from a given DNA w, how many generations does it take until 
a free amoeba is born? In the above example I = \uj\ = 6 and u has 3 different bases. After 
one generation uJi and uj2 have 5 different bases each, so they are not free. 

Heuristic. Very roughly speaking, in each generation, the number of bases doubles. Thus 
within a logarithmic number of steps, an amoeba should emerge with all different bases in 
its DNA. 

Conjecture 18. There exists a constant c> so that starting with any DNA configuration 
of length n, the probability that there is an amoeba at generation clogn with all different 
bases in her DNA is at least 1/2. 

In fact, there is a simple conjecture that would imply this and more. We call an integer- 
valued function from the space of words a complexity function if it satisfies the following 
properties. Note that wi and W2 denote the random DNA of the offspring as in ([8]). 

(1) X{w.i) < for 2 = 1,2 

(2) < iff w consists of different bases 

(3) Given w with xi'^) > 1 we have Pr[min(x(wi), x(w2)) < xi'^) ~ 1] ^ 1/2- 
We define 

Xii) = sup 

\w\<e 

It is not true that the number of bases in the DNA doubles in each generation with fixed 
probability. But we believe that there "the log number of bases" can be replaced by some 
other function of the DNA so that we get this behavior. 

Conjecture 19. There exists a complexity function x with x(^) < piogi for a fixed P > I 
and all n > 1. 

Here we show that 

Lemma 20. The function xi'w) = \M ~ rn{w), where m{w) is the number of distinct bases 
used by w is a complexity function. 
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Clearly, x(0 = ^ ~ 1) ^^'^ the first two properties are satisfied. For the third, it suffices to 
prove the following. 

Lemma 21. The probability that a given offspring of a given amoeba with some fixed DNA w 
has at least one more base in her DNA than her parent (given that the parent has a repeated 



Proof. We consider a repeated base, say a, for which the two repetitions are closest to each 
other in the DNA. 

If they have the same orientation, then at the time of fission there will be a single crossover 
symbol Xa in between the two. Given the values of all the other crossover symbols, this 
symbol is independent and random, and is active with probability 1/2. Thus, in the child, 
the first occurrences of a are from the same copy of the DNA or a different copy, with 
probability 1/2 each. 

If they have different orientation, then there must be at least another base, say 6, in 
between the two occurrences; there, b has to appear a single time, otherwise the pair of a's 
could not be closest. Thus a single 6-crossover symbol appears between the pair of a-s. 
The proof concludes as in the first case, except we condition on the value of all crossover 
symbols but x;, . □ 

Lemma 22. If x is a complexity function, then the probability, starting with DNA w, that 
there is no free amoeba at generation n is at most pi{n, \w\), where: 



Proof. Let £ = \w\. We consider the evolution of the DNA w = w{0) let w{n + 1) be 
the the one of the two children of w{n) with lower complexity. By property (3) of the 
complexity function, the process xi'w{n)) is stochastically dominated by — Sn, where 
Sn is a binomial(n, 1/2) random variable, i.e. the sum of n independent random variables 
taking the values {0, 1} with probability 1/2 each. For such independent "coin tosses", we 
have the well-known Chernoff (large deviation) bound 



base) is at least 1/2. 




P[Sn < 7^/2] < e 



.n(l-7)V4 



Setting jn/2 — x{\w\) completes the proof. 



□ 



After this brief digression into genetics we turn our attention to the symmetric 2-groups. 
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Proposition 23. There exists (3k > so that for any word w of length at most i = [/^^nj 
in k generators we have 

P{w = 1 zn Wn) = o{{2k-lY). 
As a consequence, for k random generators, we have 

girth(M4) > Pk log2 log2 \Wn\ a.a.s. 

Proof. Let no < n. By Lemmas [20l and [22l with probability l—pi{nQ, i) there is a free amoeba 
at generation uq. 

If level no of the tree is not fixed by w, then w ^ 1 in Wn and we are done. If it is fixed by 
w, then the DNA w' of the free amoeba describes the action of w on T', one of the subtrees 
of height n — Uq rooted at level uq . 

Since all bases in w' are different, when the random evaluation of w' gives a uniform 
random element of Aut(T'). Thus the conditional probability of = 1 in Wn is at most 

P2{n - no) = P{w' = 1 in Aut(r)) = | Aut(T„_„J|-i. 

Given some value of i, we set no, n so that both pi (no, ^) andp2('^— ''^o) are o{{2k—iy^). First, 
we set no = a^l where is a sufficiently large constant to make pi small. It is not difficult 
to verify that > 4(log(2/c — 1) + 1) suffices. Second, we take n — no sufficiently large so that 
p^"^ = iWn-nol ^ {2k — ly. Here the situation is much better, and n — no = 9(log£) suffices. 
Putting the two bounds together yields the lemma, for any < [4(log(2A; — 1) + 1)]^^- □ 

It was shown in [3] that typical elements have order 2"""'"°(^) with a < 1. This implies the 
following. 

Proposition 24. Even for a single random generator, we have girth(Wn) < (log 
a.a.s. 

So the symmetric two-group gives an interesting example of intermediate girth groups. 
Based on the heuristic argument before Conjecture [TSl we have 

Conjecture 25. For k random generators, there is (3 = (3k so that we have girth(W,i) = 
(log|W^„|)^+°(i) a.a.s. 

Another conjecture by Abert and the last author is closely related to this problem. 

Conjecture 26. Let w he a word of length 2™. If m < n, then P{w = 1) < 1 in Wn. 

The m < n condition is sharp, as w = a^" is satisfied by all elements in Wn- 
In general, we believe that the power word is the easiest to satisfy. 
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Conjecture 27. Let w be a word of length 2™. Then P{w = 1) < P{a^"' = 1) in Wn- 

If true, this conjecture implies that the length of the shortest non-trivial word satisfied in 
Wn is 2". 

We leave it for the reader to check that this conjecture implies that the upper bound in 
Proposition [24] has a lower bound of the same form (with a different constant a'). 

6. NONCOMMUTATIVE DIOPHANTINE PROPERTY 

In closing we mention the continuous analog of the notion of large girth suitable for 
elements in the group ring of a compact group. It was introduced in [17| (with G = SU{2)) 
and called there noncommutative diophantine property. 

Definition 28 ([17J). For k > 2, we say that gi, g2, . ■ . , Qk G G satisfy noncommutative 
diophantine property if there is a D = D{gi, . . . ,gk) > such that for any m > 1 and a 
word Wjn in gi, g2, ■ ■ ■ ,gk of length m with 7^ e (where e denotes the identity in SU(2) ) 
we have 

(9) l|W^m-e|| > D~^. 

Here 

2 

= \a\' + \b\' + \c\' + \d\'. 

Recall that G M is called diophantine if there are positive constants Ci, C2 such that for 
all {k, I) e 1? with A; 7^ we have \ke -l\> Cik-^\ Equivalently, letting g = e^^^ G SO (2), 
we may reexpress this condition as follows: — 1| > C[k~^2, A classical result |Mj asserts 
that diophantine numbers 6 are generic in measure in M. Given diophantine 61, ... ,6k and 
gi = e'^'^^^ , . . . , gk = e^'^^'^ G SO (2), for any word W in gi,...,gk of length m we have 
\Wm — 1| > Cim~'-''^ for some Ci, C2. In the case of S0(3), given gi, . . . ,gk generating a free 
subgroup, a pigeonhole argument shows that for any m > 1 there is always a word W in 
gii gi^ 1 ■ ■ ■ 1 gki Qk^ of length at most m satisfying 

11^ -^11 ^ (2fc-l)W6' 

so the exponential behavior in the definition above is the appropriate one. 
As was first exploited by Hausdorff, for G = SU{2) [18] the relation 

Wmigi, gi\ ■ ■ ■ , gk, 9k^) = e 



a b 
c d 
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where Wm is a reduced word of length m > 1 is not satisfied identically in G^''\ Hence the 
sets 

V{Wm) ■■= {{gi,...,gk)\Wm{g)=e} 

are of codimension at least one in It follows that U„^>iV [Rm) is of zero measure m 

G'^''^ and also it is of the first Baire category in G^''\ Thus the generic {gi, . . . , g^) E G^'^^ (in 
both senses) generates the free group. 

This holds quite generally: for G connected, finite-dimensional non-solvable Lie group it 
was proved by D.B.A. Epstein [15] that for each k > 0, and for almost all fc-tuples {gi, . . . , gk) 
of elements of G, the group generated hy gi, . . . , gk is free on these k elements. 

Now the set of {gi,...,gk) E G^''^ for which {gi,...,gk) is not free is clearly dense in 
G'^'^^ so it follows easily that the set of {gi, . . . ,gk) E G^^^ which are not diophantine is of 
the second (Baire) category in G^'^'^. That is to say the topologically generic ((^i, . . . ,gk) is 
free but not diophantine. On the other hand in [T7j it was proved that the elements with 
algebraic number entries are diophantine and the following conjecture was made: 

Conjecture 29. Generic in the measure sense {gi, . . . ,gk) is diophantine. 

Kaloshin and Rodnianski [23] established the following result towards conjecture [22] for 
almost every pair {A, B) E 5*0(3) x 5*0(3) there is a constant D > such that for any n and 
any word Wn{A, B) of length n in A and B the following weak diophantine property holds: 

\\Wn{A,B)~e\\>D-^\ 
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